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ABSTRACT 

A  filtering  and  data  fusion  technique  is  described  which  uses  the  correlation  between 
the  two  data  streams  of  a  dual-frequency  sidescan  sonar  in  order  to  discriminate 
against  noise  and  preferentially  enhance  features  of  desired  relative  size.  The  filter  is 
applied  to  data  collected  with  a  Klein  590  dual-frequency  sidescan  sonar  with  the 
parameters  adjusted  so  as  to  enhance  the  display  of  minelLke  objects.  Experience  with 
the  filter  shows  that  it  is  highly  effective  in  noisy  conditions. 
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Executive  Summary 


The  Klein  590  sidescan  sonar  which  is  incorporated  in  the  Royal  Australian  Navy 
(RAN)  Route  Surveillance  System  (RSS)  simultaneously  returns  two  streams  of  data 
(sonograms),  with  identical  data  rates,  corresponding  to  two  sonar  operating 
frequencies,  100  kHz  and  500  kHz.  Each  sonogram  corresponds  to  a  two-dimensional 
acoustic  "map"  of  the  seabed  and  corresponding  pixels  in  each  map  correspond  to 
areas  on  the  seabed  with  the  same  centre,  but  generally  not  the  same  size.  The  beams 
of  the  sonar  also  have  different  characteristics  for  the  two  operating  frequencies  and 
separate  electronic  circuits  to  generate  and  receive  them.  There  is  a  high  degree  of 
correlation  between  the  seafloor  "signal"  component  of  the  two  data  streams,  but  noise 
inputs  tend  to  be  less  well  correlated  between  the  two  frequencies.  This  report  details  a 
method  of  "data  fusion"  based  on  a  modification  to  the  correlation  coefficient 
expression  for  a  linear  regression,  which  when  applied  to  the  two  sonograms,  results 
in  a  third,  "merged"  sonogram  (and  corresponding  "map"  of  the  sea  floor)  which 
dicriminates  against  noise  in  favour  of  signals  from  objects  of  interest.  In  particular, 
the  technique  may  be  adjusted  so  as  to  discriminate  in  favour  of  objects  of  minelike 
sizes  and  to  reject  bottom  "clutter",  surface  reverberation  noise  and  other  sources  of 
noise  such  as  electrical  interference. 

In  this  report,  the  algorithm  used  to  produce  merged  sonograms  is  discussed.  A 
modification  of  the  correlation  coefficient  for  a  linear  regression  is  shown  to  be 
superior  to  the  correlation  coefficient  in  the  determination  of  "correlation"  between  the 
two  original  sonograms  and  some  suggestions  are  given  as  to  why  such  a  technique 
should  be  successful.  The  implementation  of  the  merging  algorithm  as  a  real-time 
process  suitable  for  routine  use  during  data  collection  or  post-processing  is  also 
discussed.  Some  example  images  are  shown  which  give  an  indication  of  the 
components  of  the  process  and  the  usefulness  of  the  algorithm.  Following  the 
discussion  of  the  algorithm,  an  attempt  is  made  to  quantify  the  usefulness  of  the 
algorithm  by  the  definition  of  a  "local  signal  to  noise  ratio"  which  gives  a  measure  of 
how  much  a  given  minelike  object  can  be  expected  to  stand  out  from  the  backgrotmd 
image  clutter.  Such  a  local  signal  to  noise  ratio  has  been  evaluated  for  hundreds  of 
mineHke  objects,  and  a  comparison  is  made  of  the  values  recorded  from  both  original 
sonograms  (100  and  500  kHz)  and  from  the  merged  sonogram.  The  values  from  the 
merged  sonogram  are  shown  to  be  consistently  superior  to  those  from  the  original 
sonogram,  usually  by  factors  of  the  order  of  6  dB  for  the  median  scores.  The 
superiority  of  the  appearance  of  mineUke  objects  in  the  merged  sonograms  appUes 
regardless  of  range  from  the  sonar  and  weather  conditions. 

In  summary,  the  merging  algorithm  described  in  this  report  appears  to  be  robust  and 
useful  in  routine  operations  where  dual-frequency  sidescan  sonar  data  is  available  and 
the  location  of  minelike  objects  is  of  interest. 
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1.  Introduction 


The  majority  of  sidescan  sonars  used  for  the  Route  Surveillance  System  (RSS)  of  the 
Royal  Australian  Navy  (RAN)  are  a  dual-frequency  device,  the  Klein  System  590, 
which  simultaneously  outputs  sonogram  data  for  two  operating  frequencies,  100  kHz 
and  390  kHz,  known  however  as  "500  kHz"  in  the  sonar  industry  and  hereafter.  The 
two  data  streams  are  highly  correlated  in  space  and  time  -  the  beams  are  emitted  from 
parallel  transducer  elements  separated  by  less  than  25  millimetres  in  space  and  by  60 
ps  in  time,  far  shorter  than  significant  transducer  motion  time-scales.  The  beams 
emitted  by  the  transducers  share  a  common  central  plane  and  only  differ  in  their 
spreading  characteristics,  and  in  their  attenuation  with  range.  Because  of  the  high 
degree  of  correlation  between  return  signals  from  the  two  beams  inherent  in  this  sonar 
system,  it  is  possible  to  discriminate  against  noise  by  combining  the  data  from  the  two 
channels  to  form  a  single  sonogram.  It  is  often  an  observable  feature  of  dual-frequency 
sonograms  that  the  noise  characteristics  of  the  two  frequencies  are  quite  different  -  one 
frequency  may,  for  instance,  be  subject  to  surface  reverberation  while  the  other  may  be 
degraded  by  electrical  interference.  If  this  "merged"  sonogram  is  of  sufficient  quality, 
it  should  be  sufficient  to  replace  the  two  original  sonograms  and  provide  the  default 
interface  during  processing  of  route-survey  or  sidescan-based  minehunting  data,  or,  at 
worst,  it  should  provide  a  useful  adjimct  to  die  images  associated  with  the  individual 
sonar  frequencies. 

To  date,  only  a  single  implementation  of  a  "merging"  process  is  known  to  the  authors. 
The  "Data  Fusion"  ©  process  implemented  by  EG&G  Marine  Instruments  Corporation 
in  their  DF-1000  digital  dual-frequency  (100-500  kHz)  sidescan  sonar  system  is 
directed  at  making  the  best  use  of  the  range  and  resolution  characteristics  of  each 
sonar  frequency.  The  sonar  operator  sets  a  range  control,  and  for  slant  ranges  below 
the  control  value,  the  higher-resolution  500  kHz  data  is  displayed.  At  higher  ranges, 
the  proportion  of  the  100  kHz  data  is  increased  to  coimter  the  attenuation  of  the  500 
kHz  data  and  for  ranges  greater  than  a  second  range  value,  only  100  kHz  data  is 
displayed.  This  process,  which  may  be  of  value  in  itself,  does  not  attempt  to  use  the 
correlation  between  the  two  sonograms  in  order  to  improve  the  image. 

In  this  report,  a  merging  algorithm  which  appears  to  be  sufficiently  robust  for  use  in 
routine  route-survey  data  inspection,  and  sufficiently  simple  to  be  implemented  as  a 
real-time  process  operating  in  the  background  of  a  display  or  acquisition  program,  is 
discussed.  The  filter  was  developed  empirically  and  represents  the  best  of  many 
attempts.  Optimal  filter  parameters  have  been  determined  empirically  and  these  are 
also  chscussed.  A  numerical  measure  of  the  effectiveness  of  the  filter  in  revealing  the 
presence  of  minelike  objects  against  a  noisy  background  is  developed  and  the 
performance  of  the  filter  is  assessed  using  the  measure  appHed  to  filtered  output  from 
data  collected  in  sea-trials  in  Jervis  Bay. 
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2.  Filter  implementation 

2.1  Determination  of  correlation  between  sonograms 

Consider  two  sonograms  U  and  V  of  the  same  scene,  collected  simultaneously  with 
beams  of  different  frequency.  Assume  that  the  sonograms  consist  of  discrete  arrays  of 
samples  and  vlh,^]  where  h  corresponds  to  the  /ith  pulse  recorded  and  the  /zth 

row  of  tine  sonogram  displayed  as  an  unage,  and  k— 0,1,...N“1  describes  the  kth  time  bin 
and  corresponds  to  the  kth  column  of  the  image.  Each  ordered  pair  [A,k]  labels  a 
particular  location  in  both  sonograms,  and  labels  two  physical  regions  on  the  sea  floor 
which  have  the  same  centre,  although  their  size  will  in  general  be  different  because  of 
the  different  widths  of  the  sonar  beams.  The  sampling  regime  does  not  vary  from 
pulse  to  pulse  and  is  the  same  for  both  frequencies,  so  that  h  and  k  may  be  treated  as 
independent  variables. 

Let  S  be  a  subset  of  M  locations  in  the  sonogram,  say  S  =  f  =  O^v-Af-l} . 

The  shape  of  the  region  chosen  by  S  is  arbitrary,  but  it  could,  for  example,  describe  a 
closed  area  on  the  sea  bed,  say  a  rectangular  or  circular  patch. 

Note  that  there  are  some  assmnptions  inherent  in  the  above  definitions.  As  noted,  h 
and  k  are  independent  variables,  since  the  returns  from  aU  pulses  are  sampled  Ihe 
same  way.  In  addition,  at  the  frequencies  of  interest,  the  seawater  medimn  may  be 
taken  to  be  linear  and  non-dispersive,  which  means  that  for  each  pulse,  the  same  array 
of  sampling  time  bins  can  be  used  for  both  frequencies.  As  noted  in  the  introduction, 
the  pulse  at  one  frequency  is  generated  before  the  other  one  and  the  sampling  times  for 
the  second  frequency  are  delayed  correspondingly.  In  the  absence  of  platform 
motions,  therefore,  each  pair  of  samples  [h,k]i  corresponds  to  a  pair  of  physical  areas 
on  the  sea  bed  with  tixe  same  centre.  In  practice,  the  interval  between  the  pulse 
generations  is  small  enough  in  comparison  to  the  time-scales  for  sonar  platform 
motion,  that  beam  motion  may  be  neglected. 

The  filtration  described  in  this  report  depends  on  two  properties,  neitiier  one  being 
sufficient  by  itself;  when  the  region  S  contains  an  object  of  interest  or  a  shadow: 

a)  The  absolute  value  of  the  divergence  of  the  signal  level  in  S  from  the 
backgroimd  level  must  be  large  in  at  least  one  of  the  sonograms;  and 

b)  There  must  be  a  correlation  between  the  signal  levels  u[h,k]i  and  v[h,k]i  in  S, 
such  that  when  condition  "a”  is  true,  then  the  divergence  of  the  signal  level  in 
the  other  sonogram  away  from  the  backgroimd  level  is  also  non-negligible. 

A  necessary,  additional  condition  placed  upon  the  filter  was  that  it  must  be  able  to 
perform  satisfactorily  when  applied  to  sonar  data  which  contain  significant  across- 
track  variations  in  background  signal  level.  Side  scan  sonars  use  time-varying  gain 
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(TVG)  algorithms  to  correct  for  such  across-track  effects,  but  it  is  not  unusual  to  find 
residual  variability  due  to  sHght  mismatches  between  the  sonar's  TVG  algorithm  and 
the  actual  across-track  signal  structure. 

2.1.1  Use  of  the  correlation  coefficient  for  a  linear  regression 

A  method  is  now  required  in  order  to  discriminate  between  correlated  and 
uncorrelated  regions  of  the  sonograms.  Let  the  two  data  sets  in  the  region  S  be 
abbreviated  by  a:,-  =  u[h,/c].  and  y,.  =  v[;i,A:],..  Bothhave  the  character  of  positive  semi- 
definite  random  variables.  If  the  region  S  is  mostly  occupied  by  an  object  of  interest  or 
a  shadow,  then  property  "b"  above  implies  that  there  should  be  a  strong  relationship 
between  the  two  data  sets.  The  simplest  relationship  is  a  hnear  one  of  the  form 

y.  »mxi  +  c  (2-1) 

where  m  and  c  will  depend  on  factors  Hke  the  gain  adjustments  of  the  sonar.  This  is 
simply  saying  that  an  increase  in  the  relative  signal  level  in  one  sonogram  due  to  a 
scatterer  is  likely  to  be  accompanied  by  a  proportional  increase  in  the  other  sonogram. 
The  fimdamental  measure  of  linear  correlation  between  the  data  sets  is  given  by  the 
raw  cross-correlation 


I  M-I 


(2.2) 


which,  since  r,  and  y,  are  random  variables,  should  peak  when  they  are  well- 
correlated.  However,  by  itself,  the  cross-correlation  is  not  a  useful  measure,  because  it 
is  not  normalised.  It  is  better  to  use  the  correlation  coefficient  for  a  linear  regression,  r, 
which  is  defined  by 


r  = 


[(<x"  >-<jr>^)(</  >-<3;>^)] 
where  the  notation  <  /  >  refers  to  the  mean  value 

1  M-l 


(2.3) 


(2.4) 


The  correlation  coefficient  r  will  have  a  value  between  -1  and  1.  The  value  r  =  +1 
occurs  when  x,  and  y,  are  perfectly  linearly  correlated,  that  is,  when  the  relationship 
(2.1)  applies  for  any  positive  value  of  m  and  c;  r  =  0  occurs  when  they  are  completely 
uncorrelated;  and  r  =  -1  occurs  when  they  are  anti-correlated,  that  is  when  m  is 
negative.  Negative  correlation  coefficients  are  unlikely  to  correspond  to  objects  of 
interest,  since  the  signals  from  the  two  frequencies  should  both  rise  on  encountering  a 
strong  reflector  and  both  fall  in  a  shadow  zone,  although  generally  to  different  extents. 
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As  it  stands,  the  correlation  coefficient  is  not  particularly  useful,  since  it  is  found  that 
neither  minelike  objects  nor  strong  shadows  tend  to  correspond  to  the  most  strongly 
correlated  regions  in  the  sonograms.  If  the  target  region  contains  strong  shadow,  then 
the  signal  level  is  minimal  and  is  dominated  by  noise  which  is  not  correlated  between 
the  two  data  sets,  so  the  value  of  r  is  small.  In  the  case  of  a  strong  reflector,  two 
factors  work  against  large  values  of  the  correlation  coefficient.  Firstly,  constructive 
and  destructive  interference  induces  a  complicated  reflected  soimd  field  which  varies 
greatly  over  small  angular  separations.  In  this  case,  the  variation  in  return  from  a 
reflective  object  has  the  character  of  multiplicative  noise  -  this  tends  to  increase  the 
standard  deviations  and  Cy  in  the  neighbourhood  of  extended  reflective  objects. 

Since  the  soimd-field  is  frequency-dependent,  the  correlation  between  the  data  sets  is 
reduced.  Secondly,  the  dynamic  range  between  the  background  and  strong  acoustic 
reflectors  is  so  large  that  the  signals  from  strong  reflectors  tend  to  saturate  or  "cUp".  If 
the  target  region  is  clipped  in  both  data  sets,  then  the  correlation  may  increase, 
however  if  only  one  is  clipped,  which  often  occurs,  the  correlation  decreases. 

What  is  required  is  a  measure  which  takes  accoxmt  of  the  property  "a"  mentioned 
previously,  that  minelike  objects  tend  to  have  quite  different  mean  signal  levels  than 
their  surroxmdings. 


2.1.2  A  modified  correlation  coefficient 

It  was  discovered  by  experiment  that  a  modification  of  the  correlation  coefficient 
expression  (2.3)  gives  a  more  useful  coefficient  than  the  exact  expression.  Expression 
(2.3)  can  be  re-arranged  as 


<(x-<x>)(y-<y>)>  <(;c-<x>)(y-<y  >)> 

r  =  - - I 


{(x-<x>y)(i^y-<y>y 


which  makes  more  obvious  the  fact  that  values  other  than  the  true  means  <x>  and 
<  ^  >  can  be  substituted  into  the  expression,  that  is,  a  pseudo-correlation  coefficient 

can  be  defined  as 


r  =  7 


<(x-X)(y-Y)> 


{(x-xy){(y-rf)r 


(2.6) 


where  X  and  F  have  been  substituted  for  the  means  of  the  data  sets  Jr,-  and  y, . 
Schwarz's  inequality  shows  that  r'  is  still  limited  to  the  range  [-1,1],  whatever  the 
values  of  X  and  Y  .  The  pseudo-correlation  coefficient  becomes  useful  when  X  and 
F  are  taken  to  mean  the  local  background  levels  near  S  in  each  of  the  two  sonograms. 
Thus,  consider  a  region  S'  which  is  larger  than  tiie  target  region  S  and  is  in  some  way 
local  to  S.  The  shape  of  S'  is  not  important,  except  that  it  constitutes  a  "background" 
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against  which  the  smaller  set  S  can  be  compared.  Let  tine  points  in  S'  be  known  as  Xi 
and  Yi,  witih  means  X  and  Y  respectively.  Now,  let  the  means  of  Xi  and  y,  in  S  be 
referred  to  as  the  "target  averages"  and  means  of  X,  and  Y,  be  referred  to  as  the 
"background  averages".  Define  the  differences  between  the  target  and  background 
averages  (the  "mean  offsets")  in  each  sonogram  by 


a=<x>  -X 
P=<y>-Y 


(2.7) 


then  substituting  for  X  and  X  in  (2.6)  and  simplifying,  we  have 

rarefy  +  ap 


r  = 


[cjI  +a^\cTl  +P^)] 


(2.8) 


Inspecting  expression  (2.8),  we  can  see  that  it  can  be  written  in  terms  of  two  "angles", 
say  4  and  ,  which  show  the  relationships  between  the  respective  target  standard 
deviations  and  the  offsets  of  the  target  means  from  the  background  means. 


Figure  1.  Relationships  between  the  "angles"  4  hirget  standard  deviations  and 

ay  and  mean  offsets  a  and  ff. 


Figure  1  shows  the  significance  of  the  angles,  which  allow  expression  (2.8)  to  be 
written  as 


r '  =  r  cos  4  cos  4  +  sin  4  sin  4  (2.9) 

where  r' ,  r  and  the  sinusoidal  quantities  all  lie  in  the  interval  [-1,  1].  Some 
observations  can  be  made  about  the  mapping: 
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a)  The  target  standard  deviations  and  are  positive  semi-definite,  so  that 

the  cosines  of  each  angle  must  also  be,  that  is,  the  angles  4  ^ 

the  range  [- ;r/2,  ;r/2] .  Thus,  increasing  pixel-by-pixel  correlation  r  in  the 
target  region  always  increases  the  value  of  r' . 

b)  When  4  and  are  both  of  small  magnitude,  then  the  cosine  terms  will  be 
close  to  +1  and  the  sine  terms  wiU  be  small.  In  this  case,  r'  wiU  be  close  to  r . 

c)  When  4  and  ^  both  have  magnitudes  close  to  nfl,  then  the  cosine  terms 
become  small  and  the  sine  terms  dominate.  In  this  circumstance,  the  value  of  r 
is  of  negligible  importance. 

d)  When  only  one  of  4  and  4  has  a  magnitude  close  to  ;r/2 ,  then  the  value  of  r 
is  likely  to  be  important  to  the  value  of  r' . 

Considering  the  points  with  regard  to  the  use  of  the  operator  in  highlighting  objects  of 
interest: 

i)  Point  "a"  implies  that  the  coefficient  is  well-behaved  with  regard  to  correlation  ■ 
in  the  target  region,  since  r'  increases  with  increasing  correlation.  This  is  a 
necessary  feature. 

ii)  Point  "b"  applies  when  a  and  p  are  small  in  comparison  to  cr,  and  G^ 

respectively,  that  is,  when  the  target  mean  offsets  from  the  backgroimd  are 
smaller  than  the  standard  deviations  in  the  target  regions.  This  wiU  apply 
when  the  target  region  contains  die  normal  "background",  in  which  case,  the 
magnitude  of  r'  wiU  depend  on  the  correlation  r  inside  the  target  region.  In 
general,  the  correlation  between  the  data  sets  increases  when  the  target  region 
contains  a  strong  reflector,  which  is  not  the  case  for  background  regions.  Thus, 
the  coefficient  tends  to  suppress  backgrotmd  regions. 

iii)  Point  "c"  appUes  when  a  and  are  large  in  comparison  to  g^  and  G^ 

respectively,  that  is,  when  the  target  mean  offsets  from  the  backgroxmd  are 
larger  than  the  standard  deviations  in  the  target  regions.  In  this  case,  as  noted, 
r'  can  be  large  regardless  of  whether  there  is  significant  correlation  between 
the  two  target  regions.  Thus,  the  coefficient  can  enhance  both  strong  reflectors 
and  strong  shadow  regions. 

2.1.3  Making  a  useful  filter 

The  criteria  given  in  the  preceding  section  make  it  clear  that  if  the  target  regions  S  and 
S'  are  chosen  correctly,  &en  the  pseudo-correlation  coefficient  should  discriminate  in 
favour  of  target  regions  with  strong  correlation  between  the  data  sets  and  large  offsets 
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between  the  target  and  backgrotmd  mean  values.  In  order  to  maximise  the  likely 
offset  between  backgrotmd  and  target  mean  values,  the  background  region  should  be 
chosen  large  enough  for  variations  in  the  target  region  to  have  negligible  impact  on  the 
background  mean,  but  small  enough  to  respond  to  changes  on  scales  much  larger  than 
likely  objects  of  interest.  Similarly,  the  target  region  should  be  chosen  to  be  as  compact 
as  possible  in  order  to  maximise  resolution,  but  sufficiently  large  to  encompass  a 
significant  portion  of  the  typical  target  size  and  establish  meaningful  values  for  the 
means  and  standard  deviations. 

Due  to  the  nature  of  sonar  signals,  the  pseudo-correlation  coefficient  alone  is  not 
sufficient  to  jdeld  a  useful  sonogram  because  of  two  limitations.  Firstly,  the  resolution 
of  the  pseudo-correlation  coefficient  is  of  the  same  order  as  the  size  of  the  target 
region,  which  is  typically  many  pixels  in  size.  Thus,  tire  sharp  edges  which  are  a  visual 
cue  to  the  presence  of  a  minelike  object  can  be  lost.  Secondly,  background  regions  and 
non-minelike  objects  can  register  with  significant  values  of  r' ,  since  the  data  sets  from 
such  regions  may  be  well-correlated  even  if  there  is  not  large  offset  from  the  mean. 

The  image  is  filtered  on  a  pixel-by-pixel  basis.  At  each  pixel,  three  quantities  are 
evaluated:  the  background  levels;  the  pseudo-correlation  correlation  coefficient  r'  for 
the  target  region  S  centred  on  the  pixel;  and  the  differences  of  the  pixel  value  in  each 
data  set  from  their  respective  background  values.  In  order  to  discriminate  against 
objects  which  do  not  stand  out  against  the  backgroxmd,  the  pseudo-correlation 
coefficient  r'  is  squared.  Typically,  pixels  corresponding  to  minelike  objects  have 
somewhat  higher  values  of  r'  than  pixels  corresponding  to  well-correlated  areas  of  the 
backgroimd.  Thus,  squaring  r'  increases  the  separation  between  the  values,  tending 
to  suppress  background  features.  The  filtered  pixel  value  is  the  square  of  r', 
multiplied  by  the  greatest  of  the  two  differences  from  the  mean  and  added  to  the 
desired  overall  filtered  backgroxmd  level,  say  half  the  maximum  pixel  level. 

The  consideration  of  the  difference  from  backgroxmd  at  each  pixel  position  has  the 
result  that  sharp  edges  tend  to  stay  sharp.  It  also  means  that  large  strong  reflectors 
tend  to  be  suppressed,  since  the  their  size  is  large  enough  to  raise  the  backgroxmd 
level. 

It  has  been  mentioned  above  that  it  is  not  xmusual  to  see  evidence  of  distinct,  across- 
track  fxmctional  variations  in  the  backgroxmd  signal  level.  These  variations  arise  from 
mismatches  between  required  and  actual  sonar  TVG,  and  they  tend  to  be  most 
noticeable  close  to  the  centre  of  the  sonar  swath.  In  this  area  the  mismatches 
occasionally  result  in  rapid  across-track  variations  in  backgroxmd  signal  level,  with  the 
pattern  of  the  variations  being  similar  for  the  two  frequencies.  Thus,  considering 
equation  (2.9),  the  value  of  r  can  be  quite  high.  Because  the  target  region  is  generally 
quite  small,  and  assuming  that  there  are  no  targets  in  the  sonar  record,  even  in  the 
presence  of  a  TVG  mismatch  the  values  of  and  wiU  generally  be  quite  small.  The 

result,  through  the  cosine  term  of  equation  (2.9),  will  be  a  relatively  large  value  of  r' 
with  the  net  effect  that  the  filtered  image  may  appear  to  be  'banded'  in  the  region  of 
the  midline.  This  problem  can  be  corrected  by  making  a  simple  modification  to 
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equation  (2.6),  where  the  local  background  means  for  the  square  region  S,  X  and  Y , 
are  replaced  by  a  series  of  column-specific  backgroimd  means, 
y^{hQ,k'\,  with  Aq  representing  the  row  coordinate  of  the  centre  of  region  S.  An 
example  of  a  suitable  method  for  calculating  and  is  included  in  the  next 
section.  The  resulting,  modified,  pseudo-correlation  coefficient  then  becomes 


<{x-Xs)iy-yB)> 


(^x-xsf)l^y-ysf) 


(2.10) 


It  is  worth  emphasising  that,  because  the  test  region  S  is  generally  qtiite  small  and 
rapidly  changing,  local  variations  in  backgrotmd  level  are  relatively  tmusual,  the  two 
pseudo-correlation  coefficients  (2.6)  and  (2.10)  wdl  generaUy  behave  almost 
identically.  An  additional  advantage  of  using  the  modified  pseudo-correlation 
coefficient  is  that  a  "circular  buffer"  approach  can  be  adopted  for  the  calculation  of  its 
components,  allowing  a  worthwhile  reduction  in  computation  time. 


2.2  Filter  implementation 

Let  VJ  be  the  "merged"  sonogram,  tiiat  is,  the  output  of  the  filtration  process  when 
given  input  sonograms  11  and  V,  with  pixels  w[h,k]. 

In  the  current  filter  implementation,  the  search  or  target  region  S  is  a  "square",  say 

s,..\  +s,k  =  kQ-  s,...kg  +  s}  (2.11) 

where  the  size  of  the  "square"  is  determined  by  the  integer  "s",  which  is  greater  than  or 
equal  to  one.  Thus,  the  smallest  squares  are  3x3,  5x5,  and  so  on.  This  shape  was 
chosen  for  ease  of  implementation  and  because  a  minelike  object  is  generally 
rectangular  on  the  seabed,  but  with  rmknown  orientation.  The  main  control  of  filter 
behaviour  is  the  parameter  "s". 

As  implemented,  the  filtration  process  required  to  produce  the  filtered  value  of  the 
sonogram  at  location  three  steps: 

1.  Calculate  the  "backgrormd  levels"  of  the  two  sonograms  across  the  width  of  the 
target  square.  For  convenience,  this  is  done  at  a  prior  stage  in  the  calculation  for 
all  pixel  positions  along  the  row,  using  simple  low-pass  infinite  impulse-response 
(HR)  (recursive)  filters  along  each  colrunn  of  the  sonograms,  with  the  form: 
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yB{K>^i'\  =  ]^y\-KA'\^iL-\)ys[h^-\,  ]] 


(2.12) 


where  I  is  the  range  coordinate  of  the  current  column  and  L  controls  the  "length" 
of  the  filter.  Thus,  the  backgroimd  region  S'  mentioned  in  section  2.1.2  is  actually 
calculated  as  25  + 1  weighted  backgroxmd  levels,  one  for  each  column  of  pixels  up 
to  the  current  ping.  The  effective  size  of  S'  is  determined  by  L.  A  series  of 
"running  averages"  along  the  columns  coidd  also  be  used,  but  would  incur  a 
computational  time  penalty.  If  it  is  known  tiiat  there  will  be  no  significant  "TVG 
effects",  then  single  background  levels  can  be  calculated  for  each  pixel  location. 
For  instance  averages  may  be  calculated  over  a  two  dimensional  region  of  which  S 
is  a  subset.  In  this  case  the  pseudo-correlation  coefficient  calculation  described  in 
the  next  paragraph  would  use  equation  (2.6). 


2.  Calculate  the  pseudo-correlation  coefficient  r'  of  the  data  set 


{x[h,k],y[h,k]:[h,k]  &S} 


(2.13) 


using  (2.10).  The  ordering  of  the  elements  of  S  is  irrelevant  in  the  calculation.  If  r' 
is  negative,  it  may  be  set  to  zero,  since  anti-correlated  regions  are  not  of  interest, 
however  this  has  little  effect  on  the  resultant  sonogram 


3.  The  merged  value  of  the  sonogram,  vv[/Zo ,  Ag  ] ,  is  given  by 


M{ho,ko]  =  b  +  {r'y  ■  max abs(x[ho M-Xs[K,ko ],y[h^ ,K]-yB [K , K ])  (2.14) 

where  &  is  an  arbitrary,  fixed,  positive  "backgroimd  level",  P  is  a  positive  exponent 
and  "maxabs(p,^)"  is  "that  one  of  p  and  q  whose  absolute  value  is  the  largest".  The 
background  level  is  added  to  allow  the  possibility  of  negative  instances  of  the 
value  selected  by  "maxabs". 

HR  filters  such  as  (2.12)  and  similar  running  summations  are  used  throughout  the  filter 
implementation  to  reduce  run-times  and  because  they  correspond  well  with  the  ping- 
by-ping  nature  of  the  recorded  data. 


2.3  Some  examples 

The  examples  which  are  included  in  this  section  were  calculated  with  the  "standard" 
pseudo-correlation  coefficient  (2.6).  Figure  2  compares  three  views  of  the  same  section 
of  sea  bed  containing  a  mineHke  object:  at  left  is  a  section  of  100  kHz  sonogram,  in  the 
centre  is  the  corresponding  section  of  500  kHz  sonogram  and  at  right  is  the  merged 
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sonogram  with  the  filter  parameters  from  Section  2.3.  The  swath  width  is  100  metres, 
aligned  up  the  page.  The  sonograms  have  been  collected  in  relatively  good  conditions, 
however  the  lower  contrast  in  the  500  kHz  section  and  the  higher  resolution  ensure 
that  the  mine  is  barely  visible.  In  the  merged  image,  the  signatures  of  the  minelike 
object  and  its  attendant  objects  are  clearly  visible,  but  most  of  the  smaller  details  on  the 
bottom  have  been  removed,  one  of  the  most  useful  features  of  this  filter. 

Figure  3  compares  the  values  of  the  true  correlation  coefficient  (2.3)  (bottom  left)  and 
the  pseudo-correlation  coefficient  (2.6)  (bottom  right)  and  the  resultant  merged  inmges 
(top  left  and  top  right,  respectively).  The  minelike  object  is  the  same  one  as  is  depicted 
in  Figure  2.  The  images  associated  with  the  pseudo-correlation  coefficient  are  clearly 
superior  to  those  using  the  true  correlation  coefficient,  in  line  with  experience  during 
the  development  of  the  filter.  Note  also  that  the  "resolution"  of  both  correlation 
coefficient  images  is  limited  by  the  use  of  the  7  by  7  moving  square  filter,  so  that  they 
are  much  less  sharp  than  the  original  images.  The  small  squares  in  the  correlation 
coefficient  images  correspond  to  isolated  spikes  occurring  in  both  sonograms.  When 
such  a  spike  is  inside  the  filter  foregrotmd  region  S,  it  dominates  the  summations  and 
consequently  all  positions  of  S  which  contain  the  spike  as  one  of  their  elements 
registers  as  a  high  coefficient  value.  Thus,  tiie  squares  are  at  minimum  13  by  13  pixels 
in  size. 

Finally,  Figure  4  compares  original  100  and  500  kHz  sonograms  of  a  minetike  object 
collected  in  conditions  of  high  electrical  interference  with  the  merged  sonogram  of  the 
same  section  of  data.  The  merged  sonogram  is  clearly  superior  to  either  of  the 
originals  in  this  case  and  tiie  minelike  object  is  more  visible,  primarily  due  to  the 
reduction  in  the  backgrotmd  noise  level,  rather  than  an  increase  in  the  absolute  pixel 
level. 
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[■lilt 


Figure  2.  A  comparison  of  the  original  corresponding  100  kHz  (left)  and  500  kHz  (centre) 
sections  of  sonogram  and  the  resultant  merged  sonogram  (right).  A  reflective  sonar 
target,  a  float  and  a  multiple  reflection  of  the  float  are  visible  in  all  three  images,  but 
most  of  the  details  of  the  background  are  absent  from  the  merged  image. 
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Figure  3.  A  comparison  of  the  true  correlation  coefficient  (bottom  left)  and  pseudo-correlation 
coefficient  (bottom  right)  in  the  region  of  the  minelike  object  depicted  in  Figure  2. 
The  corresponding  merged  images  resulting  from  the  use  of  each  coefficient  are 
shown  at  top  left  and  top  right  respectively.  The  results  from  the  pseudo-correlation 
coefficient  are  clearly  superior  to  those  from  the  true  correlation  coefficient.  In  this 
case  strong  correlation  is  represented  black  and  weak  correlation  is  represented 
white. 
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Figure  4.  A  comparison  of  corresponding  sections  of  100  kHz  (left),  500  kHz  (centre)  and 
merged  (right)  sonograms  for  a  particularly  noisy  data  set.  The  100  kHz  sonogram 
is  corrupted  by  streaking  due  to  a  faulty  slip-ring  connection  and  by  surface 
reverberation.  The  500  kHz  sonogram  is  corrupted  by  smaller  amounts  of  slip-ring 
noise.  A  minelike  object  is  visible  in  the  500  kHz  and  merged  images,  but  not  in  the 
100  kHz  image  without  processing. 
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2.4  Performance  evaluation 

Personal  experience  with  the  output  of  the  merging  filter  tends  to  suggest  that  it  is 
very  robust,  normally  highlighting  the  presence  of  minelike  objects  at  least  as  clearly 
and  reliably  to  a  human  operator  as  inspection  of  both  the  100  and  500  kHz 
sonograms.  In  many  cases,  minelike  objects  are  easier  to  locate  in  the  merged  image 
than  in  either  of  the  two  original  images,  and  this  is  particularly  the  case  when  one  of 
ihe  original  images  is  strongly  corrupted  by  surface  reverberation. 

In  order  to  provide  some  kind  of  numerical  measure  of  performance,  a  techmque  was 
devised  which  would  estimate  the  "signal  to  noise"  ratio  of  target  (minelike)  objects  in 
the  original  and  merged  sonograms. 

2.4.1  Definition  of  "local  signal  to  noise  ratio"  for  a  target  object  in  a  sonogram 

Consider  a  target  object  appearing  in  a  sonogram.  A  "local  signal  to  noise  ratio  for 
the  object  should  measure  the  amotmt  by  which  the  object  differs  from  its  immediate 
surroxmdings,  since  a  human  observer  is  able  to  discern  such  local  variations  in  the 
image.  The  extent  of  a  target  object  is  taken  to  be  limited  to  a  rectangular  region 
assigned  to  it  by  a  human  operator,  using  a  mouse-controlled  on-screen  pointer  to 
draw  a  box  arotmd  the  object,  where  the  extent  of  the  box  is  shown  by  a  "rubber  band" 
display.  The  size  of  the  box  is  selected  after  comparing  both  original  sonograms  and 
the  merged  sonogram,  and  is  set  to  contain  the  largest  extent  of  the  object  in  any  of  the 
three  sonograms.  The  box  contains  all  of  the  target  pixels  and  may  also  contain  other 
pixels,  depending  on  the  orientation  of  the  object.  The  "background"  is  then  taken  to 
be  two  regions  of  equal  width,  but  half  the  height  of  the  target  region,  immediately 
above  and  below  it.  That  is,  if  the  target  region  T  is  defined  by 


T  —  |[A,  At]:  h  —  —  kQ,...k^ 

then  the  background  regions  and  (upper  and  lower)  are  given  by 

By  =  |[A,A:]:  h  =  h,+  + 

B,  =  |[/2,A:]: h  =  h,  -l,k  = 

The  backgroimd  region  is  thus  B  =  By  +  B^,  which  contains 
Ns  ={h,-h,+l)ik,-k,+l) 


(2.15) 


(2.16) 


(2.17) 


elements,  the  same  number  as  the  target  region.  Labelling  the  elements  of  B  by  z,  the 
average  background  level  and  standard  deviation  are  derived  from 
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'Zp^-nm 


(2.18) 


where  the  are  the  values  of  the  appropriate  soiiogram  (either  of  the  originals  u  or  v, 
or  merged).  The  "n-1"  form  of  the  standard  deviation  is  used  since  the  number  of 
elements  in  B  tends  to  be  small. 

The  average  background  level  and  standard  deviation  are  used  to  determine  a 
threshold  level  for  the  elements  of  the  target  region.  This  is  necessary,  because  usually 
the  target  object  does  not  fill  the  entire  rectangular  target  region  and  the  target  may 
have  an  associated  acoustic  shadow.  The  threshold  value  ©j.  is  set  two  standard 
deviations  higher  than  the  mean  background  level,  that  is, 

©,  =  {;ff)  +  2<7,  (2.19) 

with  a  separate  value  for  each  of  the  three  sonograms.  The  target  region  of  each  of  the 
three  sonograms  is  now  thresholded,  and  elements  with  values  below  the  threshold 
are  discarded.  Let  (t)  be  the  mean  of  those  elements  of  the  target  region  whose  signal 
levels  are  above  the  threshold.  Then,  the  "local  signal  to  noise  ratio"  of  the  target 
region  in  each  sonogram  is  defined  as  the  ratio  of  the  signal  power  to  the  noise  power, 
which  is 


SNR  = 


(2.20) 


2.4.2  Variability  of  the  local  signal  to  noise  ratio 

A  concern  with  the  use  of  the  local  SNR  is  that  a  judgement  as  to  the  Hmits  of  the 
target  area  is  required  from  a  human  operator.  The  procedure  used  to  date  has  been  to 
compare  the  image  of  the  target  object  in  all  three  sonograms,  then  draw  a  box  aroxmd 
it  which  is  large  enough  to  accommodate  the  largest  observed  extent.  In  order  to 
measure  the  effects  of  this,  some  trials  were  done  in  which  SNRs  of  the  same  minelike 
object  were  assessed  repeatedly.  They  are  given  in  Table  1  following.  Each  line  in  the 
table  represents  a  separate  trial  and  the  values  are  grouped  according  to  the  box  size 
condition  which  was  being  assessed.  The  first  number  in  each  coliunn  is  the  local  SNR 
value,  in  dB  and  the  second  number  in  each  column  is  the  number  of  pixels  in  the  box 
registering  above  the  target  threshold. 
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Table  1.  Effects  of  box  size  on  local  SNR  values  for  the  same  object 


Box  Size  Condition 

100  kHz 

500  kHz 

Merged 

normal  size  box 

13.8  (81) 

9.0  (33) 

22.3  (101) 

13.9  (76) 

9.1  (30) 

22.5  (94) 

13.4  (67) 

8.5  (28) 

20.6  (78) 

sized  to  500  kHz  target 

5.1  (32) 

8.1  (24) 

7.0  (22) 

2.6  (25) 

7.8  (22) 

none  (0) 

5.1  (32) 

8.1  (24) 

7.0  (22) 

sized  to  merge  target 

14.4  (93) 

8.6  (32) 

27.0  (128) 

13.9  (83) 

8.3  (30) 

24.5  (116) 

14.2  (90) 

7.9  (29) 

27.8  (128) 

increasing  sizes  (too  small  to 

-4.2  (18) 

3.7  (11) 

none  (0) 

too  big) 

3.2  (32) 

8.3  (22) 

6.7  (5) 

10.0  (58) 

8.4  (28) 

14.4  (67) 

13.8  (77) 

8.9  (28) 

23.2  (97) 

14.2  (83) 

7.1  (25) 

25.6  (110) 

13.9  (98) 

7.6  (34) 

26.2  (135) 

Inspecting  the  table,  it  can  be  seen  that  the  SNR  values  from  different  trials  using 
approximately  the  same  sized  box  usually  give  results  stable  to  within  3  dB.  It  is  also 
clear  that  sizing  die  box  to  the  500  kHz  target  alone  is  not  advisable,  and  that  the  SNR 
values  from  the  original  sonograms  are  much  less  sensitive  to  box  size  than  those  from 
the  merged  sonogram,  presumably  because  the  backgrounds  are  noisier.  Inspecting 
the  number  of  pixels  registering  above  threshold,  it  is  clear  that  as  the  box  size  and 
hence  the  number  of  backgroxmd  pixels  represented  in  the  backgroxmd  standard 
deviation  calculation  increases,  the  target  threshold  decreases  and  more  "target"  pixels 
appear  in  the  calculation.  It  is  likely  diat  box  sizes  far  above  the  "normal"  size  do  not 
lead  to  greatly  different  SNRs.  In  the  evaluations  following,  the  SNR  values  quoted 
are  assumed  to  be  valid  within  a  margin  of  3  dB. 
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3.  Filter  operation  and  performance 

3.1  Analysis  of  Jervis  Bay  data  sets  up  to  1995  with  a  fixed  filter 
parameter 

Local  signal  to  noise  ratios  were  calculated  for  minelike  objects  appearing  in 
sonograms  coUected  during  various  DSTO  sea  trials  in  Jervis  Bay  up  to  1995, 
comprising  191  separate  SNR  evaluations  for  each  of  the  three  sonograms,  too  large  a 
data  set  to  include  in  this  report.  Some  collected  results  are  presented  graphically. 

The  filter,  as  implemented  for  these  evaluations,  used  the  square  of  the  correlation 
coefficient,  and  a  7  by  7  filter  size  was  selected;  that  is,  P=2  and  s=3.  In  this  case  die 
background  level  setting  b  was  not  fixed,  but  was  set  equal  the  overall  average  of  the 
background  levels  of  the  original  sonograms,  as  output  by  the  HR  filters  (2.12). 

Figure  5  shows  the  collected  signal  to  noise  ratios,  in  dB,  calculated  for  all  object  types, 
plotted  against  across-track  range.  The  SNR  values  for  each  sonogram  are  very 
scattered,  but  it  is  immediately  obvious  that  targets  in  the  merged  sonogram  score 
significantly  higher  than  in  either  of  the  two  "raw"  sonograms.  This  agrees  with  the 
experience  of  the  authors  during  numerous  visual  inspections  of  sonograms.  Figure  6 
shows  the  excesses  of  the  SNRs  from  the  merged  sonograms  over  those  from  each  of 
the  two  original  sonograms  for  each  particular  target.  Again,  the  values  are  almost  aU 
positive  and  some  are  very  large.  Detailed  inspection  of  the  data  reveals  that  out  of 
191  separate  mine  and  target  identifications,  there  are  13  instances  (6.8%)  in  which  the 
signal  to  noise  ratio  of  a  target  in  the  merged  sonogram  is  exceeded  by  the  signal  to 
noise  ratio  in  one  or  the  other  of  the  raw  sonograms,  and  of  these,  1  instance  in  which 
it  is  exceeded  in  both  of  the  raw  sonograms. 

Almost  invariably,  the  background  standard  deviation  in  the  merged  images  is  very 
low,  that  is,  there  is  little  noise  power  remaining  in  the  sonogram.  As  a  result  of  this, 
and  recalling  equation  (2.20),  very  large  (frequently  over  30  dB)  signal  to  noise  ratios 
are  a  characteristic  of  many  targets  in  the  merged  sonogram.  The  background  in  the 
merged  sonograms  is  much  less  varied  than  in  the  original  sonograms.  This  is  because 
the  pseudo-correlation  coefficient  in  background  areas  is  usually  less  than  1,  and  it  is 
squared  before  being  multiplied  by  the  local  difference  between  foreground  and 
backgroimd  regions,  which  is  itself  already  a  small  number  when  the  region 
considered  does  not  contain  a  strong  reflector.  Note  that  visual  inspection  of  the 
targets  with  high  SNRs  shows  that  they  are  indeed  very  obvious  objects  in  the 
sonogram. 

Given  the  large  spread  of  values  in  all  the  SNR  evaluations,  it  was  decided  to  use  the 
median  score  for  each  data  set  as  the  best  indication  of  target  visibility.  Table  2  shows 
the  median  scores  for  tiie  entire  data  set  and  for  the  various  subsets  corresponding  to 
different  classes  and  orientations  of  minelike  objects.  The  spread  of  values  is  much 
lower  than  in  Figures  5  and  6,  which  lends  some  credibility  to  the  use  of  a  median 
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score.  The  results  reinforce  the  previous  observations  that  the  merging  filter  makes  the 
mineiike  objects  more  readily  visible  than  they  were  in  the  original  images.  It  also 
seems  to  show  fiiat  in  general,  the  type  of  minelike  object  under  consideration  makes 
relatively  little  difference  to  the  performance  of  the  filter,  or  even  to  the  visibility  of  the 
mine  in  the  original  sonograms. 


Table  2.  Median  signal  to  noise  ratios  in  dBfor  sea-trial  data  sets  to  1995 


Target  type 

100  kHz 

500  kHz 

merged 

aU  possible 

11.4 

11.9 

18.2 

DSTO  reflector  1 

13.1 

15.9 

23.1 

DSTO  reflector  2 

14.3 

15.5 

21.8 

minelike  la 

14.4 

10.6 

23.1 

mineiike  lb 

14.9 

14.7 

18.0 

minelike  Ic 

9.2 

12.1 

18.2 

mineiike  Id 

9.9 

11.4 

14.7 

mineiike  2 

14.4 

11.5 

26.3 

mineiike  3a 

14.6 

11.4 

22.8 

mineiike  3b 

15.8 

10.2 

24.8 

mineiike  3c 

6.9 

11.4 

17.4 

mineiike  3d 

8.4 

11.3 

14.0 

mineiike  4 

16.4 

13.8 

24.2 
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Figure  5.  A  comparison  of  local  signal  to  noise  ratios  (SNRs)  for  minelike  objects  observed 
during  DSTO  sea  trials  recorded  over  a  period  of  3  years  to  1995.  In  each  instance, 
the  image  of  the  minelike  object  was  inspected  in  all  three  sonograms,  and  the 
evaluation  box  for  the  local  SNR  was  drawn  so  as  to  contain  the  largest  of  the 
images,  the  same  box  being  used  for  all  three  images.  The  data  were  collected  in  a 
range  of  weather  conditions  and  with  many  dijferent  types  of  minelike  object,  but 
SNR  values  for  the  merged  sonograms  are  consistently  larger  than  for  either  of  the 
original  sonograms,  and  this  is  true  regardless  of  range. 
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range  across  track,  m 

Figure  6.  Differences  between  the  local  SNR  values  from  the  merged  sonograms  and  the 
original  100  and  500  kHz  sonograms  for  the  191  SNR  evaluations  from  DSTO  sea- 
trial  data  whose  absolute  SNR  values  are  depicted  in  Figure  4.  Most  values  are 
positive,  indicating  that  the  merged  sonogram  scored  higher  than  the  original 
sonogram. 

3.2  Analysis  of  Jervis  Bay  1996  data  sets  collected  with  different 
towfish  configurations 

After  sea  trials  in  Jervis  Bay  in  1996,  further  SNR  data  became  available,  this  time 
involving  only  a  two-week  period  of  data  collection.  Data  was  analysed  in  the  same 
manner  as  the  earlier  trials,  except  that  in  this  case,  the  merge  data  background  setting 
b  was  set  to  a  fixed  value  of  100.  In  this  case,  the  sonar  configuration  was  subject  to 
numerous  changes  dxiring  the  trial,  as  different  baffles  were  attached  to  the  towfish  in 
order  to  deliberately  change  the  sotmd  field  entering  the  transducers  from  surface 
reverberation.  After  inspection  of  the  data,  it  was  not  possible  to  distinguish  a 
consistent  effect  on  the  SNR  values  due  either  to  the  different  towfish  configurations, 
nor  to  the  different  reflective  targets.  Table  3  shows  the  median  SNR  values  from  all 
data  sets,  and  for  each  of  the  separate  DSTO-constructed  reflective  targets.  Note  that 
there  is  as  much  variation  between  the  median  SNR  values  for  targets  1  and  2,  which 
are  supposedly  identical,  as  there  is  between  the  rest  The  SNRs  for  the  merged 
sonogram  are  however  consistently  higher  than  for  the  original  sonograms. 
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Table  3.  Median  signal  to  noise  ratios  in  dB  from  Jervis  Bay  sea  trials,  1996 


Data  set 

100  kHz 

500  kHz 

merged 

all  possible 

10.9 

8.3 

18.8 

DSTO  reflector  1 

8.6 

7.5 

16.4 

DSTO  reflector  2 

10.2 

8.7 

18.1 

DSTO  reflector  3 

11.4 

8.4 

19.9 

DSTO  reflector  4 

10.9 

8.1 

19.5 

Figure  7  shows  the  local  SNR  values  calculated  for  the  entire  data  set.  The  results  are 
similar  to  those  depicted  in  Figure  5  and  again  the  merged  sonogram  scores 
consistently  higher  than  the  original  sonograms.  As  the  results  for  the  merge  data  are 
comparable  to  those  of  earlier  trials,  the  choice  of  background  setting  is  clearly  not 
critical  to  the  performance  of  the  filter. 


Figure  7.  A  comparison  of  local  signal  to  noise  ratios  (SNRs)  for  minelike  objects  observed 
during  DSTO  sea  trials  recorded  during  a  two-week  period  in  1996.  The  procedure 
for  evaluating  the  SNRs  was  the  same  as  is  Figure  4.  In  contrast  with  Figure  4,  the 
data  in  this  figure  were  collected  with  different  configurations  of  towfish 
attachments  intended  to  block  the  path  of  sound,  coming  from  above  the  towfish, 
into  the  transducers.  Once  again,  however,  the  SNR  values  for  the  merged 
sonograms  are  consistently  higher  than  those  of  either  of  the  original  sonograms, 
and  this  is  true  regardless  of  range. 
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3.3  Why  the  filter  works 

Considering  Section  2.1.2,  we  see  that  the  filter  is  controUed  by  the  true  correlation 
coefficient  t  of  the  100  and  500  kHz  sonograms,  and  by  the  "angles  ^  and  ^ ,  which 

compare  the  differences  of  the  target  and  background  means  with  the  respective  target 
standard  deviations.  Minelike  objects  are  not  generally  associated  with  the  highest 
values  of  the  true  correlation  coefficient  registering  in  the  image,  which  makes  the 
value  of  r  relatively  unimportant  in  the  calculation.  In  contrast,  inspection  of  the  target 
and  backgroxmd  means  and  the  target  standard  deviations  corresponding  to  areas 
containing  minelike  objects  shows  that  the  differences  between  the  means  are  mostly 
several  timps  larger  than  the  standard  deviations.  That  is,  it  is  usually  true  that  at  least 
one  of  4  and  is  large  in  the  vicinity  of  a  minelike  object  and  they  are  rarely  of 

opposite  sign,  so  the  pseudo-correlation  coefficient  expression  is  dominated  by  the 
product  of  sines.  The  value  of  the  pseudo-correlation  coefficient  is  usually  close  to  1  at 
minelike  objects. 

Some  factors  dependent  on  the  nature  of  the  Klein  590  dual-frequency  sidescan  sonar 
appear  to  assist  the  operation  of  the  filter  and  allow  it  to  be  used  at  aU  ranges  across 
the  swath,  which  is  taken  here  to  be  100  m  wide. 

Firstly,  at  ranges  above  50  m,  the  high  rates  of  attenuation  associated  with  the  500  kHz 
frequency  mean  that  it  is  usually  noise-limited,  such  that  highly  reflective  objects 
register  above  the  noise  levels,  but  the  less  reflective  common  features  of  the  bottom 
do  not.  Hence,  r'  is  often  small  ever5rwhere  except  in  the  region  of  a  target,  since  there 
is  neither  high  true  correlation,  nor  significant  difference  from  the  backgrotmd. 

Secondly,  the  beam  of  the  500  kHz  transducer  is  less  receptive  to  energy  incident  from 
above  the  horizontal  than  is  tiie  100  kHz  transducer.  In  data  collected  in  shallow 
water,  this  means  that  surface  reverberation  (wave  clutter)  appearing  m  the  100  kHz 
sonogram  is  often  much  less  evident  or  absent  from  the  500  kHz  sonogram,  which 
allows  the  filter  to  reduce  the  impact  of  surface  reverberations  on  the  merged 
sonogram.  This  is  most  apparent  at  large  ranges  across  track. 


Finally,  considering  the  heavily  streaked  sonograms  in  Figure  4,  the  merging  filter  has 
been  successful  in  this  case  because  the  noise  levels  in  the  sonograms  were 
uncorrelated,  so  that  r  was  small,  and  4  and  4  were  also  not  correlated,  allowing  the 
filter  to  successfully  discriminate  against  the  noise. 
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4.  Conclusion 

This  report  has  demonstrated  a  filtration  process  which  may  be  successfully  and 
routinely  used  to  reduce  the  impact  of  high  noise  levels  in  dual-frequency  sonograms. 
It  has  been  implemented  as  a  PC-based  computer  program  which  is  capable  of  real¬ 
time  operation  and  is  currently  under  assessment  by  personnel  of  the  Royal  Australian 
Navy.  The  filter  is  relatively  simple  to  implement  and  offers  considerable  benefit  in 
practice. 


23 


DISTRIBUTION  LIST 


"Merge"  -  A  Filter  for  the  Fusion  of  Dual-Frequency  Sidescan  Sonar  Data 
Roger  A.  Neill  and  Stuart  D.  Anstee 
AUSTRALIA 


1.  DEFENCE  ORGANISATION 

a.  Task  Sponsor  DGFD  (Sea)  -  DDMWD 

b.  S&T  Program 

Chief  Defence  Scientist  1 

FAS  Science  Policy  }  shared  copy 

AS  Science  Corporate  Management  J 

Director  General  Science  Policy  Development 
Counsellor  Defence  Science,  London  (Doc  Data  Sheet ) 

Counsellor  Defence  Science,  Washington  (Doc  Data  Sheet ) 

Scientific  Adviser  to  MRDC  Thailand  (Doc  Data  Sheet ) 

Director  General  Scientific  Advisers  and  Trials/Scientific  Adviser  Policy  and 
Command  (shared  copy) 

Navy  Scientific  Adviser 

Scientific  Adviser  -  Army  (Doc  Data  Sheet  and  distribution  list  only) 

Air  Force  Scientific  Adviser 
Director  Trials 

Aeronautical  and  Maritime  Research  Laboratory 
Director 

Chief  of  Maritime  Operations  Division 

Alan  Theobald 

Brian  Ferguson 

Roger  Neill 

Stuart  Anstee 

John  Best 

Kurt  Benke 

David  Hedger 

Electronics  and  Surveillance  Research  Laboratory 
Director  (Doc  Data  Sheet  only) 

Tim  Payne,  LSOD 

DSTO  Library 
Library  Fishermens  Bend 
Library  Maribymong 
Library  Salisbury  (2  copies) 

Australian  Archives 
Library,  MOD,  Pyrmont 

c.  Forces  Executive 

Director  General  Force  Development  (Land)  (Doc  Data  Sheet  only) 


d.  Navy 

SO  (Science),  Director  of  Naval  Warfare,  Marititne  Headquarters  Annex, 
Garden  Island,  NSW  2000.  (Doc  Data  Sheet  only) 

COMAUSMINDIVFOR 
COMAUSMINDIVFOR-MWRSO 
Operations  Manager,  (MCD),  MWSC 
DNW,  Attn  LCDR  Peter  Stokes 

Hydrographies  Projects  Officer,  Hydrographies  Projects  Office 
MHD  c/-  Hydro  Office,  8  Station  St,  Wollongong 

CMDR  Paul  Spencer,  CO  HMAS  Moresby,  Warship  Section 
Perth  Mail  Exchange,  WA  6758 

e.  Army 

ABCA  Office,  G-1-34,  Russell  Offices,  Canberra  (4  copies) 

g.  S&I  Program 

Defence  Intelligence  Organisation 

Library,  Defence  Signals  Directorate  (Doc  Data  Sheet  only) 

1.  B&M  Program  (libraries) 

OIC  TRS,  Defence  Regional  Library,  Canberra 

Officer  in  Charge,  Document  Exchange  Centre  (DEC),  1  copy 

*US  Defence  Technical  Information  Center,  2  copies 

*UK  Defence  Research  Information  Centre,  2  copies 

*Canada  Defence  Scientific  Information  Service,  1  copy 

*NZ  Defence  Information  Centre,  1  copy 

National  Library  of  Australia,  1  copy 

2.  UNIVERSITIES  AND  COLLEGES 

Austrahan  Defence  Force  Academy 
Library 

Head  of  Aerospace  and  Mechanical  Engineering 

Deakin  University,  Serials  Section  (M  hst),  Deakin  University  Library,  Geelong,  3217 
Senior  Librarian,  Hargrave  Library,  Monash  University 
Librarian,  Flinders  University 

3.  OTHER  ORGANISATIONS 

NASA  (Canberra) 

AGPS 

OUTSIDE  AUSTRALIA 

4.  ABSTRACTING  AND  INFORMATION  ORGANISATIONS 
INSPEC:  Acquisitions  Section  Institution  of  Electrical  Engineers 
Library,  Chemical  Abstracts  Reference  Service 
Engineering  Societies  Library,  US 

Materials  Information,  Cambridge  Scientific  Abstracts,  US 
Documents  Librarian,  The  Center  for  Research  Libraries,  US 


5.  INFORMATION  EXCHANGE  AGREEMENT  PARTNERS 
Acquisitions  Unit,  Science  Reference  and  Information  Service,  UK 
Library  -  Exchange  Desk,  National  Institute  of  Standards  and  Technology,  US 

TTCP  GTP-13  National  Leaders 
USA-  Doug  Todoroff 
Canada  -  Ron  Kuwahara 
NZ  -  Reinhard  Holtkamp 
UK  -  John  Wickenden 


SPARES  (10  copies) 

Total  number  of  copies:  72 


Page  classification:  UNCLASSMHD 


_  CD  .  - 

DEFENCE  SCIENCE  AND  TECHNOLOGY  ORGANISATION 
DOCUMENT  CONTROL  DATA 

1.  PRIVACY  MARKING/CAVEAT  (OF 

DOCUMENT) 

2.  TITLE 

"Merge"  -  A  Filter  for  the  Fusion  of  Dual-Frequency  Sidescan 
Sonar  Data 

3.  SECURITY  CLASSIFICATION  (FOR  UNCLASSIFIED  REPOKl  S 
THAT  ARE  LIMITED  RELEASE  USE  (L)  NEXT  TO  DOCUMENT 
CLASSIFICATION) 

Document  (U) 

Title  (U) 

Abstract  (U) 

4.  AUTHOR(S) 

Roger  A.  Neill  and  Stuart  D.  Anstee 

5.  CORPORATE  AUTHOR 

Aeronautical  and  Maritime  Research  Laboratory 

PO  Box  4331 

Melbourne  Vic  3001 

6a.  DSTO  NUMBER  6b.  AR  NUMBER 

DSTO-TR-0511  AR-010-173 

6c.  TYPE  OF  REPORT  7.  DOCUMENT  DATE 

Technical  Report  March  1997 

8.  FILE  NUMBER  9.  TASK  NUMBER  10.  TASK  SPONSOR 

510/207/0636  ADS  94/193  DGFD  (Sea) 

11.  NO.  OF  PAGES  12.  NO.  OF 

23  REFERENCES 

13.  DOWNGRADING/DELIMITING  INSTRUCTIONS 

None 

14.  RELEASE  AUTHORITY 

Chief,  Maritime  Operations  Division 

15.  SECONDARY  RELEASE  STATEMENT  OF  THIS  DOCUMENT 

Approved  for  public  release 

OVERSEAS  ENQUIRIES  OUTSIDE  STATED  LIMITATIONS  SHOULD  BE  REFERRED  THROUGH  DOCUMENT  EXCHANGE  CENTRE,  DIS  NETWORK  OFFICE, 
DEFT  OF  DEFENCE,  CAMPBELL  PARK  OFFICES,  CANBERRA  ACT  2600 

16.  DELIBERATE  ANNOUNCEMENT 

No  Limitations 

17.  CASUAL  ANNOUNCEMENT  Yes 

18.  DEFTEST  DESCRIPTORS 

Image  processing;  Data  fusion;  Route  surveillance;  Sidescan  sonar;  Target  detection 

19.  ABSTRACT 

A  filtering  and  data  fusion  technique  is  described  which  uses  the  correlation  between  the  two  data 
streams  of  a  dual-frequency  sidescan  sonar  in  order  to  discriminate  against  noise  and  preferentially 
enhance  features  of  desired  relative  size.  The  filter  is  applied  to  data  collected  with  a  Klein  590  dual¬ 
frequency  sidescan  sonar  with  the  parameters  adjusted  so  as  to  enhance  the  display  of  minelike  objects. 
Experience  with  the  filter  shows  that  it  is  highly  effective  in  noisy  conditions. 

Page  classification;  UNCLASSIFIED 


DSTO# 


AERONAUTICAL  AND  MARITIME  RESEARCH  ^BORATORY 
GPO  BOX  4331  MELBOURNE  VICTORIA  3001 
AUSTRALIA,  TELEPHONE  (03)  9626  7000 


TECHNICAL  REPORT  DSTO-TR-0511  AR-010-173  MARCH  1997 


